Requested Patent: EP0744716A2 



Title: METHOD AND APPARATUS FOR AUTHENTICATING DOCUMENTS ; 

Abstracted Patent: EP074471 6 ; 

Publication Date: 1996-1 1-27 ; 

Inventor(s): CUMMING HEATHER L (GB) ; 

Applicant(s): NCR INT INC (US) ; 

Application Number: EP1 9960303493 1 996051 6 ; 

Priority Number(s): GB1 995001 0678 1 9950525 ; 

IPC Classification: G07D7/00 ; 

Equivalents: DE69605854D, DE69605854T, ES2140787T, JP9016777, ZA9603604 ; 

ABSTRACT: 

In a method and apparatus for authenticating documents, at least one small area (22) on a 
document (12) being tested is sensed by a spectroscope (26), and the light intensity at a plurality 
(e.g. 50) of spectral points is measured by a photodiode array (30). The resulting signals are 
digitized in an analog-to-digital converter (38) and the resulting data is stored. This data is then 
analyzed by discriminant analysis, whereby the intensity values treated as components of a vector 
are multiplied by one or more sets of weighting coefficients (discriminant functions) to provide a 
set of discriminant function values. A distance measurement between this set of values and the 
centroid of a corresponding set of values for genuine documents is compared with a threshold 
value to determine the authenticity of the document (12). 
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(54) Method and apparatus for authenticating documents 



(57) In a method and apparatus for authenticating 
documents, at least one small area (22) on a document 
(1 2) being tested is sensed by a spectroscope (26), and 
the light intensity at a plurality (e.g. 50) of spectral points 
is measured by a photodiode array (30). The resulting 
signals are digitized in an analog-to-digital converter 
(38) and the resulting data is stored. This data is then 



analyzed by discriminant analysis, whereby the intensity 
values treated as components of a vector are multiplied 
by one or more sets of weighting coefficients (discrimi- 
nant functions) to provide a set of discriminant function 
values. A distance measurement between this set of val- 
ues and the centroid of a corresponding set of values 
for genuine documents is compared with a threshold 
value to determine the authenticity of the document (12). 
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Description 

This invention relates to a method and apparatus 
for authenticating documents. 

Automatic machines which accept banknotes and 
other valuable documents such as cheques are becom- 
ing more widely used. It is important for such machines 
to authenticate the documents, that is, to distinguish be- 
tween genuine and counterfort documents. 

U.K. Patent Application No. GB-A-2 192 275 dis- 
closes a system for authenticating banknotes by detect- 
ing colours thereof by reflected or transmitted light. Op- 
tical fibre bundles direct light from a light source onto 
the banknote, and the reflected or transmitted light is 
incident on a plurality of colour filters which pass the light 
they transmit to respective further optical fibres which 
transmit the light to respective photosensors. The output 
signals from the photosensors are analysed to deter- 
mine the authenticity of the banknote, by comparing da- 
ta representing the detected signals or signal ratios with 
corresponding reference data derived from a genuine 
banknote. This known system is based on a comparison 
technique and has the disadvantage of requiring the 
storage of large amounts of reference data. 

It is an object of the present invention to provide a 
method and apparatus for authenticating documents, 
which is capable of authenticating documents in an ef- 
ficient manner, yet requires only a small amount of 
stored reference data. 

Therefore, according to the present invention, there 
is provided a method of determining the authenticity of 
a document, characterized by the steps of: dispersing 
light derived from an area of said document into a spec- 
trum, generating a plurality of electrical signals repre- 
senting light intensity values in a corresponding plurality 
of spectral wavebands in said spectrum, storing data 
representing said electrical signals, and analyzing the 
stored data by discriminant analysis to determine the 
authenticity of said document. 

It is found that the use of discriminant analysis to 
determine the authenticity of documents results in good 
classification of documents as authentic or non-authen- 
tic, with only a low rate of misclassification, while involv- 
ing the storage of only a low quantity of reference data. 

One embodiment of the present invention will now 
be described by way of example, with reference to the 
accompanying drawings, in which: - 

Fig. 1 is a diagram of a document authentication 
system according to the present invention; 
Fig. 2 is a plot illustrating the classification of docu- 
ments; and 

Fig. 3 is a flowchart showing the process utilized by 
the system shown in Fig. 1 for determining the au- 
thenticity of a document. 

Referring to Fig. 1 , there is shown a simplified block 
diagram of a document authentication system 1 0. A doc- 



ument 12, whose authenticity is to be determined, is fed 
by document transport means 14 to a sensing station 
1 6, where the document 1 2 is maintained in a stationary 
state for a time sufficient to sense the document in a 
5 manner to be described. Alternatively, the document 
could be placed manually in the sensing station 16. Lo- 
cated at the sensing station 16 is a broadband (white) 
light source 18 which directs a narrow, collimated beam 
of light over a light path 20 to illuminate a small circular 
10 area 22 on the document 1 2. Light from the area 22 
passes via a light path 24 to a spectroscope 26 which 
disperses the incident light into a spectrum output beam 
28, in the wavelength range of from 400 to 900nm, for 
example. The spectroscope 26 may be a standard, corn- 
's mercially available spectroscope. 

The dispersed light beam 28 is applied to a photo- 
diode array 30. The number of photodiode in the array 
30 may depend on the application. In one example, 50 
photodiodes are produced, thereby producing electrical 
signals representing incident light intensity on a corre- 
sponding number of output lines 32, which are connect- 
ed via respective amplifiers 34 to a multiplexer 36. How- 
ever, the number of photodiodes 30 is not a limitation, 
and more, or fewer, than 50 photodiodes may be uti- 
lized. Also, signals derived from a relatively large 
number e.g. 250, of sensors, may be compressed by 
using a computer program to a smaller number, e.g. 50, 
of points per spectrum. It is found that good classifica- 
tion results may be achieved with as few as 1 5 spectral 
points, for example. 

The output of the multiplexer 36 is applied to an an- 
alog-to-digital converter 38 which provides, on a serial 
output 40, digital data representing the light intensities 
incident on the respective photodiodes of the photodi- 
ode array 30. This data is stored in a memory 42 which 
is connected to a processor 44 which processes the da- 
ta using the statistical technique of discriminant analy- 
sis, in a manner to be described, utilizing reference data 
stored in a memory 46, and provides an output signal 
on a line 48, identifying the document 12 as genuine or 
counterfeit 

As mentioned, the processor 44 operates on the da- 
ta in accordance with the statistical technique of disc rim - 
inent analysis. It is assumed that the documents being 
tested for authenticity are all of the same document type. 
For example, if the documents are banknotes, it is as- 
sumed that the banknotes are alt issued by the same 
issuing bank and are of the same denomination, for in- 
stance, the documents 12 may be Bank of England ten 
pound notes. It will be appreciated that if the apparatus 
1 0 is located in a machine capable of accepting various 
types of banknotes, for example, then an initial recogni- 
tion step may be required to recognize the document 
type, and provide a signal to access the appropriate ref- 
erence data stored in the memory 46. 

Samples of the particular document type, e.g. Bank 
of England ten pound notes, are utilized in a preliminary 
procedure to calculate discriminant functions for use in 
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the reference data memory 46. In one embodiment, it is 
assumed that banknotes of three classes, namely gen- 
uine notes, colour photocopied notes and forged notes 
(other than colour photocopied notes) are available, 
wherein forged notes have been produced by printing 
procedures more sophisticated than colour photocopy- 
ing. In one example, 200 genuine notes, 100 colour pho- 
tocopied notes, and 15 forged notes were utilized, al- 
though these figures are not a limitation and other num- 
bers of samples may be utilized. The colour photocopied 
notes and the forged notes are examples of counterfeit 
notes. All the sample notes are fed in turn to a sensing 
station, similar to the sensing station 16 (Fig. 1), and 
digital light intensity samples at the same number of 
spectrum sampling points are produced in a manner 
similar to that described with reference to Fig. 1 , resum- 
ing in stored data which can be regarded mathematically 
as a vector corresponding to each banknote sample, the 
vectors having 50 components i.e. being vectors in 
50-dimensional space. If these vectors are regarded as 
points in such 50-dimensional space, it should be un- 
derstood that the points corresponding to the class of 
genuine notes are clustered together, the points corre- 
sponding to the class of colour photocopied notes are 
clustered together, and the points corresponding to the 
class of forged notes are clustered together. Thus, there 
are three classes of clustered points. 

A description of the statistical technique of discrimi- 
nant analysis can be found, for example, in the book by 
R.O. Duda and RE. Hart: "Pattern Classification and 
Scene Analysis," John Wiley & Sons, 1973, at pages 
114-121. Briefly, the technique aims at "projecting 0 the 
points in the high (e.g. 50) dimensional space to a lower 
dimensional space which is of a dimension one less than 
the number of classes, i.e. where there are three class- 
es, to two-dimensional space, while retaining a high de- 
gree of clustering, corresponding to the original cluster- 
ing. For this purpose, functions are computed which 
maximize the ratio of between-class scatter to within- 
class scatter. Thus, for example, the projection from 
50-dimensional space to 2-dimensional space is ac- 
complished by two discriminant functions. Mathemati- 
cally, this corresponds to the equations:- 

50 

yi= E wiixi 
i=l 



50 

y 2 = S wi2Xi 
i=l 



where the Xj (i=1,...,50) are the digitized spectral inten- 
sity components, w il (i=1,..,50) and w i2 (i=1,...50) are 
the two sets of discriminant function coefficients, and 
and y 2 are the projected discriminant function values in 
5 2-dimensional space of the 50-dimensional vector x, 
(i=1,...,50). A procedure for computing discriminant 
functions is set forth in the aforementioned Duda and 
Hart textbook reference, for example. The discriminant 
functions w n and w^ (i=1,...,50) are stored. 
to it will be appreciated that each sample note gives 
rise to corresponding discriminant function values (y., , 
y 2 ) in 2-dimensional space. The next step in the proce- 
dure is to calculate the mean (centroid) discriminant 
function values for the genuine notes. Referring to Fig. 
2, there is shown a plot of discriminant function values 
(y 1( y 2 ) for the various sample notes. The discriminant 
function values for the genuine sample notes are shown 
as small solid circular areas; the discriminant function 
values for the colour photocopied sample notes are 
shown as crosses; and the discriminant function values 
for the forged sample notes are shown as small outline 
circles. It is seen that the discriminant function values 
are disposed in three clusters 60, 62 and 64, corre- 
sponding to the genuine sample notes, the color photo- 
copied sample notes and the forged sample notes re- 
spectively. It will be appreciated that Fig. 2 is simplified 
by not showing the full number of discriminant function 
values, for clarity. However, the clustering of the discri- 
minant function values in three clusters 60,62 and 64 is 
clearly seen. 

Next, the mean (centroid) values (m-j.nr^) of the dis- 
criminant function values for the genuine notes in the 
cluster 60 are calculated and stored. These values are 
represented by the point 66 shown in the plot of Fig. 2. 

It should be understood that there has now been 
computed, and stored, reference data in the form of the 
discriminant function coefficients w n (i = 1 ,...,50) and w j2 
(i = 1,...,50) and the mean discriminant function values 
(m-,, m 2 ) for the genuine notes. Also, a threshold value 
T (to be explained) is entered and included in the refer- 
ence data. This reference data may now be transferred 
to the memory 46, contained in the authentication sys- 
tem 1 0, Fig. 1 for testing the authenticity of an unknown 
banknote. For example, the reference data may be 
stored on a diskette which is transported to the location 
where an authentication system 10 (Fig. 1) is installed. 
Copies of such diskette could be utilized to transfer the 
reference data to any locations where an authentication 
system such as the system 10 is situated. 

The manner in which a document 12 is tested for 
authenticity will now be described with reference to the 
flowchart 80 of Fig. 3. First as shown in block 82, light 
from the small area 22 of the document 1 2 being tested 
is dispersed by the spectroscope 26 (Fig. 1), with the 
dispersed beam being sensed by the photodiodes 30, 
thereby generating the 50 intensity values which are dig- 
itized and stored. 

Next, as shown in block 86, the discriminant f unc- 
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tion values for the note 12 being tested are calculated 
using the discriminant function coefficients W h (i=1 
50) and W j2 (i=1,...,50) stored in the reference data 
memory 46, thereby providing a pair of values corre- 
sponding to a point (y v y 2 ) on the plot shown in Fig. 2. 
Then, as shown in block 88, the distance of this point 
from the centroid discriminant function value point 66 is 
calculated. 

Finally, as shown in block 90 a comparison is made 
as to whether the calculated distance is less than the 
threshold value T, included in the reference data. If yes, 
then a signal is produced on the output line 48 (Fig. 1) 
indicative of the document 1 2 being authentic (block 92). 
If no, then the signal on the output line 48 is indicative 
of the document 12 being counterfeit (block 94). Refer- 
ring to Fig. 2, it will be appreciated that the distance com- 
parison effectively determines whether or not the point 
on the plot corresponding to the document 1 2 being test- 
ed lies inside the circle 96 having centre 66 and radius 
T. If the point lies inside the circle 96, then the document 
1 2 is determined to be authentic. If not, the document is 
determined to be non-authentic (counterfeit). 

It will be appreciated that if the document 12 is de- 
termined as non-authentic, the signal on the line 48 may 
be effective to return the document to an entry slot (not 
shown) or divert the document to a reject bit (not shown). 
If the document is determined to be authentic a trans- 
action can be performed. For example, if the document 
is a banknote, a financial transaction may be initiated. 

Modifications of the described embodiment are 
possible. For example, the number of classes of docu- 
ments may differ from the three classes utilized in the 
described embodiment (genuine, colour photocopies, 
other forged documents). Thus, there may be four class- 
es (new genuine banknotes, used genuine banknotes, 
colour photocopied banknotes, other forged ban- 
knotes). In this case there will be three discriminant 
functions, instead of two, and instead of the two-dimen- 
sional plot (Fig. 2) a three-dimensional plot will be pro- 
duced. The new genuine banknotes and used genuine 
banknotes produce respective clusters of discriminant 
function values which overlap, and the mean (centroid) 
of all these discriminant function values is taken as the 
point corresponding to the point 66 (Fig. 2) from which 
the distance is measured during the authentication pro- 
cedure for an unknown document. It will, of course, be 
appreciated that the circle 96 (Fig. 2) is replaced by a 
sphere and that authentic documents correspond to 
points within the sphere. 

In another modification, there may be only two 
classes of documents, namely genuine banknotes (new 
and used), and counterfeit banknotes (colour photocop- 
ied and other forged banknotes). In this modification, 
there is only one discriminant function and the discrimi- 
nant function values are arranged in two clusters along 
a straight line. 

It will be appreciated that in the above-described 
embodiment and modifications, the distance measure- 



ment used to determine the distance between the dis- 
criminant function values of a document being tested, 
and the centroid discriminant function values is the 
standard Euchidean distance measurement. As an al- 
s ternative, the Mahalanobis distance could be used in 
which case the decision curve or surface, corresponding 
to the circle 96 or sphere, discussed above, would be 
an ellipse or ellipsoid, with a document being character- 
ized as authentic if its calculated discriminant function 
10 values correspond to a point inside the ellipse or ellip- 
soid. The concept of Mahalanobis distance is well 
known to those skilled in the pattern recognition art. For 
example, see page 24 of the aforementioned textbook 
by Duda and Hart for a discussion of the Mahalanobis 
is distance concept. 

In yet another modification, instead of single small 
area of the document 1 2 (Fig. 1 ) being tested being used 
to obtain the light intensity values used in the discrimi- 
nant analysis procedure described hereinabove, a plu- 
rality of such small areas, for example three such areas, 
located at different points on the document being tested 
may be utilized. Thus, the light source 1 8, Fig. 1 may be 
controlled to direct light successively towards three dif- 
ferent small areas of the document 12. Alternatively, if 
additional equipment were provided, three small areas 
could be sensed simultaneously. The data dervied from 
each area would be utilized to provide an authenticity 
signal, and the three authenticity signals would be uti- 
lized, for instance using a majority voting procedure, to 
categorize the document as authentic if at least two of 
the signals were indicative of an authentic document. 
This modification will result in an increased amount of 
data to be analysed by the discriminant analysis proce- 
dure, but more reliable results may be achieved. In yet 
another modification, the document 12 may be sensed 
while it is moving. This will require an appropriate control 
of the photodiode array 30 to provide signals corre- 
sponding to a desired small area or areas to be sensed. 
In another modification light could be directed towards 
and/or sensed from the document by using optical fi- 
bres. 



Claims 

1 . A method of determining the authenticity of a doc- 
ument (12), characterized by the steps of: dispers- 
ing light derived from an area (22)of said document 
(12) into a spectrum, generating a plurality of elec- 
trical signals representing light intensity values in a 
corresponding plurality of spectral wavebands in 
said spectrum, storing data representing said elec- 
trical signals, and analyzing the stored data by dis- 
criminant analysis to determine the authenticity of 
said document 

2. A method according to claim 1 , characterized in that 
said step of analyzing the stored data includes the 
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steps of: calculating discriminant function values 
utilizing the stored data; determining a distance 
measurement representing the distance between 
the calculated discriminant function values and ref- 
erence discriminant function values, and determin- s 
ing said document as authentic if said distance 
measurement is less than a predetermined thresh- 
old value. 

3. A method according to claim 1 or claim 2, charac- 10 
terized in that said distance measurement is a Eu- 
clidean distance measurement. 

4. A method according to claim 1 or claim 2, charac- 
terized in that said distance measurement is a Ma- is 
halanobis distance measurement. 

5. A method according to any one of claim 2 to 4, char- 
acterized in that said reference discriminant func- 
tion values correspond to centroid discriminant 20 
function values derived from genuine documents. 

6. A method according to any one of the preceding 
claims, characterized by the step of utilizing a plu- 
rality of areas on said document (1 2) to generate 25 
data representing light intensity values in said plu- 
rality of spectral wavelengths. 

7. Apparatus for determining the authority of a docu- 
ment (12), characterized by light dispersing means 30 
(26) adapted to disperse light derived from an area 

of said document (12) into a spectrum, light sensing 
means (30) adapted to provide signals representing 
light intensity values in a plurality of spectral wave- 
bands in said spectrum, storage means (42) adapt- 35 
ed to store data representing said electrical signals, 
and analyzing means (44) adapted to analyze said 
data using discriminant anaysis, and to provide an 
output signal representing the authenticity of said 
document (12). 40 

8. Apparatus according to claim 7, characterized by 
analog-to-digital converter means (38) adapted to 
convert said signals representing light intensity val- 
ues to digital form for storage in said storage means 45 
(42). 

9. Apparatus according to claim 7 or claim 8, charac- 
terized in that said light dispersing means includes 

a spectroscope (26). so 
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